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ENGINEERING OF METABOLIC CONTROL 

Government Rights 

This invention was made with Government support under Grant DE-FG03- 
98ER20298 with the U.S. Department of Energy and Grant BES-9814097 with the 
National Science Foundation. The Government has certain rights in the invention. 

5 Cross Reference 

This application claims priority from U.S. provisional application serial number 
60/145,801, filed July 27, 1999. 

Background of the Invention 

The use of recombinant DNA technology has allowed the engineering of host 
10 cells to produce desired compounds, such as polypeptides and secondary metabolites. 
The large scale production of polypeptides in engineered cells allows for the production 
of proteins with pharmaceutical uses and enzymes with industrial uses. Secondary 
metabolites are products derived from nature that have long been known for their 
biological and medicinal importance. Because of the structural complexity inherent in 
15 such molecules, traditional chemical synthesis often requires extensive effort and the use 
of expensive precursors and cofactors to prepare the compound. In recent years, the 
expression of heterologous proteins in cells has facilitated the engineering of 
heterologous biosynthetic pathways in microorganisms to produce metabolites from 
inexpensive starting materials. In this manner, a variety of compounds have been 
20 produced, including polyketides, p-lactam antibiotics, monoterpenes, steroids, and 
aromatics. 



Summary of the Invention 

The invention is based, in part, on the discovery that production of heterologous 
25 polypeptides and metabolites can be enhanced by the regulated expression of the 
polypeptide (e.g., a biosynthetic enzyme) using a promoter which is regulated by the 
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concentrations of a second metabolite, e.g. acetyl phosphate. The term "heterologous" 
refers to a polypeptide or metabolite which is introduced by artifice. A heterologous 
polypeptide or metabolite can be identical to endogenous entity that is naturally present. 
The term "metabolite" refers to a organic compound which is the product of one or more 

5 biochemical reactions. A metabolite may itself be a precursor for other reactions. A 
secondary metabolite is a metabolite derived from another. 

Accordingly, in one aspect, the invention features a bacterial host cell containing 
a nucleic acid sequence comprising a promoter and a nucleic acid sequence encoding a 
heterologous polypeptide. Examples of bacterial host cells include Escherichia coli, 

10 Bacillus subtilis, Salmonella typhimurium, Agrobacterium tumefaciens, Thermus 

thermophilus, and Rhizobium leguminosarum cells. The nucleic acid sequence is operably 
linked to the promoter which is controlled by a response regulator protein. In other 
words, the nucleic acid sequence is linked to the promoter sequence in a manner which 
allows for expression of the nucleotide sequence in vitro and in vivo. "Promoter" refers 

is to any DNA fragment which directs transcription of genetic material. The promoter is 
controlled by a response regulator protein, for example, ntrC, phoB, phoP, ompR, cheY, 
creB, or torR, of E. coli or its homologs from other bacterial species. Further, the 
response regulator protein can be another member of the cluster orthologous group 
(COG) COG0745 as defined by http://www.ncbi.nlm.nih.gov/COG/ (Tatusov et al 

20 Nucleic Acids Res. (2000); 28:33-36). In one implementation, the promoter is bound by 
E. coli ntrC. The term "ntrC" refers to both the E. coli ntrC protein (SWISSPROT : 
P06713, http://www.expasy.ch/) and its homologs in other bacteria as appropriate. As 
used herein, "bound" refers to a physical association with a equilibrium binding constant 
(Kd) of less than 100 nM, preferably less than 1 nM. An example of the promoter is the 

25 E. coli glnAp2 promoter, e.g. a region between positions about 93 and about 323 in the 
published DNA sequence, GenBank accession no. M10421(Reitzer & Magasanik (1985) 
Proc Nat Acad Sci USA 82:1979-1983). This region includes untranslated sequences from 
the glnA gene. Further, a translational fusion can be constructed between coding 
sequences for glnA and coding sequences for the heterologous polypeptide. 

30 The host cell is genetically modified such that the promoter is regulated by acetyl 

phosphate in the absence of nitrogen starvation. For example, the host cell can 
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genetically modified by deletion or mutation of a gene encoding a histidine protein 
kinase, e.g., a member of COG0642 as defined by (http://www.ncbi.nlm.nih.gov/COG/; 
Tatusov et al. supra.), e.g., glnL, phoR, phoQ, creC, or envZ. In another example, the 
histidine protein kinase has specificity for the response regulator protein which controls 

5 the promoter. The histidine protein kinase can be encoded by glnL, e.g., E. coli glnL 
(SWISSPROT P06712; http://www.expasy.ch/). 

Whereas the host cell is genetically modified such that the promoter is regulated 
by acetyl phosphate in the absence of nitrogen starvation, for heterologous polypeptide or 
metabolite expression, the host cell can be propagated in any desired condition, e.g., in 

10 nitrogen starvation conditions, nitrogen poor conditions, or nitrogen rich conditions. 

The heterologous polypeptide encoded by the nucleic acid sequence can be a 
biosynthetic enzyme required for production of a metabolite. It can be a mammalian 
protein, e.g., a secreted growth factor, a monoclonal antibody, or an extracellular matrix 
component. In yet another example, the heterologous polypeptide can be a desired 

15 antigen for use in a vaccine, e.g., a surface protein from a viral, bacterial, fungal, or 
protist pathogen. 

Another aspect of the invention features a kit containing a nucleic acid sequence 
which includes a promoter controlled by a response regulator protein. The kit further 
optionally contains a bacterial host cell which is genetically modified such that the 

20 promoter is regulated by acetyl phosphate in the absence of nitrogen starvation. The kit 
can also provide instructions for their use. The nucleic acid sequence can contain a 
restriction enzyme polylinker located 3' of the promoter such that a sequence inserted 
into the polylinker is operably linked to the promoter which is controlled by a response 
regulator protein. In one implementation of the kit, the promoter is the E. coli glnAp 2 

25 promoter and the bacterial host cell is an E. coli cell containing a mutation or deletion of 
the glnL gene. 

Another aspect of the invention features a host cell containing a first expression 
cassette. The first expression cassette includes a promoter, such as any of those described 
above, and a nucleic acid sequence encoding an enzyme required for biosynthesis of a 
30 heterologous metabolite. As used herein, "enzyme" refers to a polypeptide having ability 
to catalyze a chemical reaction or multiple reactions. The nucleic acid sequence is 
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operably linked to the promoter which is regulated by acetyl phosphate in the absence of 
nitrogen starvation. The host cell also contains additional nucleic acid sequences for 
expressing other enzymes required for biosynthesis of the metabolite. Such additional 
sequences may be endogenous sequences expressing endogenous enzymes, or introduced 
sequences expressing heterologous enzymes. 

In one example, the heterologous metabolite is an isoprenoid, a 
polyhydroxyalkanoate, a polyketide, a pMactam antibiotic, an aromatic, or a precursor, 
e.g., an upstream metabolite, or a derivative, e.g., a downstream metabolite, thereof. For 
instance, the isoprenoid can be a carotenoid, a sterol, a taxol, a diterpene, a gibberellin, 
and a quinone. Specific examples of isoprenoids include isopentyl diphosphate, 
dimethylallyl diphosphate, geranyl diphosphate, farnesyl diphosphate, geranylgeranyl 
diphosphate, and phytoene. Specific examples of carotenoids include P-carotene, C,- 
carotene, astaxanthin, zeaxanthin, zeaxanthin-P-glucoside, phytofluene, neurosporene, 
lutein, and torulene. When the desired heterologous metabolite is an isoprenoid, the 
heterologous enzyme can be isopentenyl diphosphate isomerase, geranylgeranyl 
diphosphate synthase, or 1 -deoxyxylulose 5-phosphate synthase. When the desired 
heterologous metabolite is an polyhydroxyalkanoate, the heterologous enzyme can be 3- 
ketoacyl reductase, or poly-3-hydroxyalkanoate polymerase. 

The host cell can be a bacterial cell, e.g., an E. coli cell. The host cell is 
optionally genetically modified by deletion or mutation of a gene, e.g., a gene encoding a 
histidine protein kinase, as described above. In one specific example, the host cell further 
contains a second expression cassette containing a nucleic acid sequence encoding 
phosphoenolpyruvate synthase operably linked to a promoter regulated by acetyl 
phosphate concentration, e.g., glnAp 2 . 

Another aspect of invention features a method of producing heterologous 
isoprenoids in a host cell. The method includes overexpressing phosphoenolpyruvate 
synthase and expressing biosynthetic enzymes required for synthesis of the heterologous 
isoprenoid. In one implementation, a gene in the host cell encoding a pyruvate kinase or 
a phosphoenolpyruvate carboxylase is genetically deleted or enfeebled. In another 
implementation, a gene encoding phosphoenolpyruvate carboxykinase is overexpressed 
in the host cell. Still another aspect of the invention features a method of producing a 
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lycopene in a host cell. The method includes expressing the following heterologous 
enzymes: 1-deoxy-D-xylulose 5-phosphate synthase, a geranylgeranyl diphosphate 
synthase, a phytoene synthase, and a phytoene saturase. In one implementation of this 
method, an isopentenyl diphosphate isomerase is overexpressed, e.g., using the glnAp2 
promoter. In another implementation, a phosphoenolpyruvate synthase is overexpressed, 
e.g., using the glnAp2 promoter. 

Another aspect of the invention features a nucleic acid sequence containing a 
promoter and a sequence encoding a biosynthetic enzyme required for the production of a 
first metabolite. The promoter is operably linked to the sequence, and is regulated by a 
second metabolite whose concentration is indicative of availability of a precursor for the 
biosynthesis of the first metabolite. In one example, the second metabolite is a waste 
product produced from a precursor for the biosynthesis of the first metabolite. 

In one implementation, the first metabolite is a polyhydroxyalkanoate, e.g., 
polyhydroxybutyrate and the nucleic acid sequence encodes a biosynthetic enzyme, e.g., 
a 3-ketoacyl coenzyme A (coA) reductases, or a poly-3-hydroxyoctanoyl-CoA 
polymerase. In another case, the first metabolite is a polyketide, a p-lactam antibiotic, or 
an aromatic. In a yet another case, the first metabolite is an isoprenoid, e.g., an 
isoprenoid mentioned herein. The nucleic acid sequence can encode a biosynthetic 
enzyme required for isoprenoid production, e.g., isopentenyl diphosphate isomerase, 
geranylgeranyl diphosphate synthase, 1-deoxyxylulose 5-phosphate synthase, 
phosphoenolpyruvate synthase, farnesyl diphosphate synthase, geranylgeranyl 
diphosphate synthase, phytoene synthase, phytoene desaturase, or lycopene cyclase. One 
precursor of isoprenoids can be pyruvate. Pyruvate concentrations are related to acetate 
and acetyl-phosphate concentrations. Accordingly, in this instance, the second 
metabolite is acetyl phosphate. The promoter responding to acetyl phosphate can be 
controlled by a response regulator protein, e.g., a response regulator protein mentioned 
above. Such a promoter may only respond to acetyl phosphate in a specific host cell. In 
a particular example, the promoter responding to acetyl phosphate concentration is bound 
by E. colt ntrC, e.g., E. coli glnAp 2 promoter. 

The promoter can be regulated by cAMP. The promoter can be a bacterial 
promoter which binds CAP (catabolite activator protein). In mammals, the promoter can 
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be a promoter containing a cAMP response element (CRE), which binds to the proteins 
CREB, CREM, or ATF-1. In yeast cells, the promoter can be a promoter regulated by 
cAMP, or a promoter bound by proteins Gisl, Msn2, or Msn4. Another possible 
regulatory signal for the promoter can be fructose 1 -phosphate, or fructose 6-phosphate. 
The E, coli FruR protein regulates such promoters. 

The nucleic acid sequence can be contained on a plasmid. It can also contain a 
bacterial origin of replication and a selectable marker. The sequence can further contain 
a yeast or other eukaryotic origin of replication and appropriate selectable markers, and 
can be integrated into the genome. 

The optimization of biosynthesis of heterologous compounds in host cells is 
reliant on sensing parameters of cell physiology and on utilizing these parameters to 
regulate the biosynthesis. One standard techniques in the art is to grow cells and for the 
user to exogenously add an agent, e.g., an inducer, to turn on genes required for 
biosynthesis of the desired product. It has been widely observed that high-level induction 
of a recombinant protein or pathway leads to growth retardation and reduced metabolic 
activity. (Kurland and Dong ( 1 996) Mol Microbiol 21:1-4). The practice of 
exogenously supplying an inducer is empirical and does not monitor the availability of 
resources in the cell for biosynthesis. In contrast, natural pathways rely on feedback 
mechanisms to control such processes. The combination of certain promoters with 
specific genetically defined host cells and heterologous polypeptides in this invention 
unexpectedly results in a highly refined and versatile control circuit that regulates flux to 
heterologous polypeptide or metabolite synthesis in response to the metabolic state of the 
cell. Indeed, the dynamically controlled recombinant pathway provides for enhanced 
production, minimized growth retardation, and reduced toxic by-product formation. The 
regulation of gene expression in response to physiological state will also benefit other 
applications, such as gene therapy. 

The details of one or more embodiments of the invention are set forth in the 
description below. Other features, objects, and advantages of the invention will be 
apparent from the description and from the claims. 
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Detailed Description 

The invention provides methods of engineering metabolic control, e.g., methods 
of utilizing promoters in specific host cells in order to optimize protein expression for 
either protein production or metabolite synthesis. 

5 A central component of the invention is an expression cassette comprising a 

promoter and nucleic acid sequence encoding a heterologous polypeptide whose 
expression is desired. The expression cassette is constructed using standard methods in 
the art such that the coding nucleic acid sequence is operably linked, e.g., regulated by, 
the promoter. The promoter is chosen such that the promoter is regulated by a parameter 

10 of cell physiology or cell metabolic state. A variety of promoters can be used. In some 
applications the expression cassette is contained within a plasmid, such as bacterial 
plasmid with a bacterial origin of replication and a selectable marker. The expression 
cassette can be integrated into the genome of cells using standard techniques in the art. 
If the expression cassette is to be used for engineering regulated production of a 

15 heterologous polypeptide during late logarithmic growth or during stationary phase, then 
the promoter can be chosen accordingly. For example, a promoter can be chosen that 
responds to small molecule signal, e.g., a second messenger, whose levels accumulate 
during late logarithmic growth or during stationary phase. The second messenger can be 
a molecule that accumulates as a precursor, an intermediate, or a waste product of a 

20 biochemical pathway. In bacteria, the small molecule signal can be a glycolysis 

intermediate, e.g., fructose 1 -phosphate or fructose 6-phosphate or a glycolysis waste 
product, e.g., acetate or acetyl phosphate. In eukaryotic cells, cAMP concentrations are a 
well known signal of nutrient state. 

The promoter in the expression cassette can be chosen based on the results of a 

25 large scale expression analysis experiment, e.g., a gene chip experiment. Genes which 
are induced by acetyl phosphate can be identified by hybridizing to a microarray labeled 
cDNA prepared from cells in grown in acetate and comparing the signal to a reference 
signal, e.g., to the signal of obtained with cDNA prepared from cells in early logarithmic 
growth. This experiment can be performed on both prokaryotic and eukaryotic cells, e.g., 

30 bacterial, yeast, plant and mammalian cells. For an example of such an experiment in a 
prokaryote, see Talaat et al. (2000) Nat Biotechnol 18:679-82 and Oh & Liao (2000) 
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Biotechnol Prog. 16:278-86. Once a gene is identified which is expressed under the 
desired condition, its promoter can utilized in the expression cassette. Alternatively, the 
experiment can be performed by the exogenous addition of a desired molecule (e.g., a 
precursor in a metabolic pathway) or by manipulation of experimental conditions (e.g., 
growth to late logarithmic phase or growth while a biosynthetic enzyme is 
overproduced). Promoters can be identified based on the genes induced. 

In one instance, an expression cassette is used for engineering regulated 
production of a metabolite in a bacterial cell. The promoter can be selected which is 
regulated by a second metabolite whose concentration is indicative of the availability of a 
precursor for the biosynthesis of the first metabolite. For example, if the first metabolite 
is an isoprenoid which is synthesized from the precursors, pyruvate and glyceraldhyde 3- 
phosphate, then the second metabolite can be acetyl phosphate. In a rich environment, 
cells produce an excess amount of acetyl-CoA, a product of pyruvate. The excess 
acetyl-CoA is used to produce ATP and acetate, which is secreted as a waste product. 
Acetate concentration increases with cell density. Acetate, acetyl-CoA, and acetyl- 
phosphate concentrations are interrelated by to the following biochemical reactions: 

(1) acetyl-CoA + Pj <r^ acetyl phosphate + Co A 

(2) acetyl phosphate + ADP <-> acetate + ATP 

Thus, high acetyl phosphate concentration is indicative of excess acetyl-CoA and 
excess pyruvate. A host cell which is genetically modified by deletion or mutation of 
glnL, for example, causes ntrC function to become acetyl phosphate dependent (Feng et 
al (1992) J Bacteriol 174:6061-6070). In this fashion, a promoter regulated by ntrC, 
e.g., the glnAp2 promoter, can be used to control gene expression in response to acetyl 
phosphate. The glnAp2 promoter can be obtained using standard techniques in the art. 
For example, primers can be designed and synthesized that anneal to the glnAp2 
promoter. The polymerase chain reaction (PCR) can be used to amplify a nucleic acid 
fragment containing the glnAp2 promoter. This fragment can now be used for further 
constructions. Likewise, an E. coli strain containing deletion of histidine protein kinase 
gene, e.g., glnL can be easily prepared. See Link et al (1997) J Bacteriol 179(20):6228- 
6237 for a detailed description of one possible method. The sequences encoding a 
desired heterologous polypeptide can be cloned downstream of the glnAp2 promoter so 
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that it is operably linked to the promoter. A host cell with an inactivated glnL gene can 
then be transformed with the sequences. The transformed strain can be grown, and 
polypeptide production monitored during the course of growth. Robust protein 
expression can be observed at high cell densities, as in Farmer and Liao (2000) Nat. 
Biotechnol 18:533-537, the contents of which are hereby incorporated by reference. 

A mammalian cell can be used as a host cell for polypeptide or metabolite 
production. A promoter can be selected, e.g., a promoter that responds to cAMP. Such a 
promoter can contain a cAMP response element (CRE), which binds to the proteins 
CREB, CREM, or ATF-1. Using standard techniques in the art, a desired coding 
sequence can be placed under control of the promoter and transformed into the 
mammalian cell. In some instances, the construction can be inserted into a virus, e.g., an 
inactivated virus. Such implementations allow for the regulated production of a protein 
or a metabolite produced by a heterologous biosynthetic enzyme in a gene therapy 
scenario. Plant cells can also be used as host cells. Again, an appropriate promoter can 
be chosen, e.g., a promoter than responds to a plant hormone, metabolite, or a precursor 
for the production of a desired metabolite. A promoter can be identified by a microarray 
experiment. After fusion of a desired promoter to a desired coding sequence in an 
appropriate vector, the construction can be electroporated into Agrobacterium 
tumefaciens and then used to transform plant cells using standard methods in the art. In 
still another example, yeast cells can be manipulated to express heterologous 
polypeptides or metabolites under metabolic control. For example, a Saccharomyces 
cerevisiae promoter can be a promoter regulated by cAMP, e.g., a promoter bound by 
proteins Gisl, Msn2, or Msn4. The regulation of all yeast genes in response to a variety 
of metabolic conditions is increasingly well studied. For example, DeRisi et al (1997) 
Science 275:680-686 describe experiments following the transcriptional profile of nearly 
the entire Saccharomyces cerevisiae gene set under various metabolic conditions. 
Promoters regulated by a desired metabolite can be selected based on such data. The 
generation of yeast plasmids and the transformation of yeast are well known in the art. 

A variety of metabolic pathways can be reconstructed using the expression 
techniques described above. For example, a pathway to produce lycopene can be 
introduced in E. coli by constructing expression vectors for the following genes: dxs 
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(coding for 1 -deoxy-D-xylulose 5-phosphate synthase) from E. coli, gps (coding for 
geranylgeranyl diphosphate (GGPP) synthase) from Archaeoglobusfulgidus, and crtBI 
(coding for phytoene synthase and desaturase, respectively) from Erwinia uredovora. 
These genes can reside on a single or multiple plasmids, or can be integrated into the E. 
coli chromosome. In addition, phosphoenolpyruvate synthase can be overexpressed 
using any method, e.g., by fusion to the glnAp2 promoter. Isopentyl diphosphate 
isomerase can be overexpressed using any method, e.g., by fusion to the glnApl 
promoter. 

In another example, a pathway to produce polyhydroxyalkanoates (PHA), e.g., 
polyhydroxybutyrate can be implemented in E. coli. PHA is a family of linear polyesters 
of hydroxy acids with a variety of thermoplastic properties and commercial uses. 
Pseudomonas aeruginosa genes encoding 3-ketoacyl coenzyme A reductases and poly-3- 
hydroxyalkanoate polymerase can be placed under regulation of a desired promoter, e.g., 
glnAp2, since acetyl-CoA levels can be indicative of precursor availability for PHA 
synthesis. 

Without farther elaboration, it is believed that the above description has 
adequately enabled the present invention. The following examples are, therefore to be 
construed as merely illustrative, and not limitative of the remainder of the disclosure in 
any way whatsoever. All publications cited herein are hereby incorporated by reference 
in their entirety. 

Methods 

Growth conditions. All E. coli strains were grown in shake flasks containing the 
designated medium at 37°C in waterbath shakers (Model G76; New Brunswick 
Scientific, Edison, NJ). The cultures were grown in minimal media consisting of either 
M9 defined salts 34 containing 0.5% (wt/vol) glucose or YE defined salts containing 
1.5% (wt/vol) glucose. YE defined salts consisted of (per liter) 14 g K 2 HP0 4 , 16 g 
KH 2 P0 4 , 5 g (NH 4 ) 2 S0 4 , 1 g MgS0 4 , and 1 mg thiamine. Cell turbidity was monitored 
spectrophotometrically at 550 nm. 

Metabolite measurements. Acetate, pyruvate, and other organic acids were measured 
using HPLC (Constametric 3500 Solvent Delivery System and Spectromonitor 3 1 00 
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Variable Wavelength Detector; LDC Analytical, Riviera Beach, FL) over an organic 
acids column (Aminex HPX-87H, Bio-Rad Laboratories, Hercules, CA) maintained at 
65°C. The mobile phase consisted of 0.01 N H2S04, and its flow rate was kept at 0.6 
ml min' 1 . Peaks coming off the column were detected at 210 ran. Glucose was measured 
using Sigma kit no. 315-100. To quantify lycopene, 1 ml of bacterial culture was 
extracted with acetone, centrifuged, and the supernatant absorbance was measured at 474 
nm. Lycopene concentrations were calculated by comparing absorbances to a standard 



curve. 



SDS-PAGE and enzyme assays. The protocol for SDS-PAGE is as described by 
Laemmli (1970) Nature 227:680-685. Measurement of p-galactosidase activity was 
carried out essentially as described by Miller (1992) A Short Course in Bacterial 
Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor NY. 

Results 

Usage of the glnAP2 promoter in E. coli in a heterologous fusion to lacZ. 

Increasing levels of acetyl phosphate can be an indicator of excess glucose flux. 
The current invention features host cells, nucleic acids sequences, and methods of 
utilizing acetyl phosphate as a signal to regulate the expression of rate-controlling 
enzymes in a desired metabolic pathway, both to utilize fully the excess carbon flux and 
to redirect the flux away from the toxic product, acetate. 

In order to examine the potential of glnAp2 as a dynamic controller of product 
expression, a nucleic acid sequence was constructed containing a heterologous lacZ gene 
operably linked to the glnAp 2 promoter. The glnAp2 promoter region containing the 
promoter and two ntrC-binding sites can be easily obtained by standard methods known 
in the art. The glnAp2 promoter was PCR-amplified from E. coli genomic DNA using 
the forward primer 5'-CAGCTGCAAAGGTCATTGCACCAAC (containing an 
engineered Pvull site) and the reverse primer 5'-GGTACCAGTACGT- 
GTTCAGCGGACATAC (containing an engineered Kpnl site). These two primers 
amplified a region between positions 93 and 343 in the published DNA sequence 16 
(GenBank accession No. Ml 0421). 
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The glnAp2 PCR fragment was also cloned into the £coRI site of pRS55 1 , thus 
generating p2GFPuv, which contains glnAp2 in front of a promoterless lacZ gene. The 
glnAp2-lacZ region was transferred to fcRS45 via homologous recombination (Simons et 
a/.(1987) Gene 53:85-96), generating phage X.p2GFPuv. JCL1595 and JCL1596 were 
constructed by integrating a glnAp2-lacZ fusion via infection (Silhavy et al. (1984) 
Experiments with Gene Fusions, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor NY) with Xp2GFPuv phage into the chromosomes of BW1371 1 (lacX74) and 
BW18302 (lacXglnL2001; Feng et al. supra), respectively. This strain contains the 
glnL2001 allele, which consists of an internal deletion between codons 23 and 182 of the 
glnL coding sequence and presumably results in a null mutation (Feng et al. supra). 

The time course of the p-galactosidase (p-gal) activity was measured in wild-type 
and in the glnL mutant. The glnAp2-$-ga\ activity increases in a time-dependent fashion 
similar to the excreted acetate concentration from the glnL host (JCL1596), whereas no 
induction of promoter activity was found for the isogenic wild-type control (JCL1595). 
Table 1. P-galactosidase activity of glnAp2-lacZ 





P-galactosidase activity 




(nmol/min-mg protein) 




6 hours 1 1 hours 


g lnAp2-lacZ in WT (JCL1 595) 


<ioo ~ioo 


glnAp2-lacZ in glnL (JCL1 596) 


-700 -1500 


P hc -lacZ in (VJS632) 


-500 -550 



Thus, in the absence of glnL, glnAp2 is capable of responding to the excess 
carbon flux that is indicated by acetate excretion. As the cells approached the late- 
exponential phase, the biosynthetic requirement decreased and the cells began to exhibit 
an excess carbon flux, as demonstrated by the increased generation of acetate. At this 
point, at approximately 6 hours, unexpectedly glnAp2-$-gal activity began to rise to 
(-700 nmol/min-mg protein, see Table 1) whereas glnAp2-$-gd\ activity in the wild-type 
strain (JCL1595) was relatively low and remained constant throughout (-100 nmol/min- 
mg protein, Table 1). After more than 10 hours, glnAp2-$-ga\ activity in the absence of 
glnL was a remarkable -1500 nmol/min-mg protein (Table 1). The induction profile of 
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glnAp2 is also in stark contrast to that of the lac promoter (P /flC ). Chromosomal ? lac 
activity in strain VJS632 (lac + ) rapidly increased after induction with IPTG (isopropyl-p- 
D-thio-galactopyranoside) and achieved a constant level of expression in the cell (-550 
nmol/min-mg protein, see Table 1), which is independent of the growth phase. 

Usage of the glnAP2 promoter in E. coli in a heterologous fusion to pps and aroG 

Expression of two different metabolic enzymes, phosphoenolpyruvate synthase 
(pps) and 3-deoxy-D-arabinoheptulosonate 7-phosphate (DAHP) synthase (aroG) were 
placed under the control of the glnAp2 promoter. As controls, these same two proteins 
also were overexpressed from the tac promoter (P tec ), which exhibits static control, under 
the same genetic background and environmental conditions. Standard methods of 
expressing jo/ay leads to growth retardation (Patnaik et al (1992) J Bacteriol 174:7527- 



Plasmid pAROG was constructed by cloning a PCR fragment containing aroG 
pRWStkt into the EcoKl-Bamm sites of pJFl 18EH. Plasmid pPS706 has been 
previously described in Patnaik et al supra. Both plasmid express the respective genes 
under the P tac promoter. The PCR fragment containing the glnAp2 promoter was cloned 
into the EcoRV-EcoRI sites of plasmids pAROG, and pPS706 to generate plasmids 
p2AROG3, and pPSG706, respectively containing the respective genes under the glnAp2 
promoter. 

Host strain BW18302 (lacX glnL2001) was transformed with all four plasmids. 
The strains with the respective plasmids were grown in M9 salts-glucose media. Growth 
was compared after 5 hours. 



7532). 
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Table 2. Growth of Overexpressing Strains 


OD550 after 5 hours growth 


No plasmid 


-0.5 


Ptac-aroG 


-0.5 


glnAp2-aroG 


-0.5 


Piac-pps 


-0.12 


glnAp2-pps 


-0.4 



As previously demonstrated, overexpression of pps using P ta c-pps caused marked 
growth retardation. However, the use of glnAp2 unexpectedly resulted in close to normal 
growth (Table 2). After 15 hours, proteins were isolated from each strain and analyzed 
on a 10% SDS-PAGE gel. At least 500% more pps protein was expressed when the pps 
gene was controlled by the glnApl promoter compared to the P tac promoter. In another 
surprising finding, AroG protein, whose conventional overexpression is not overtly 
detrimental, was also at least 300% more abundant in extracts from cells utilizing glnAp2 
promoter for expression compared to the P ta c promoter. 

Production of Lycopene in E. coli by idi Overexpression 

We reconstructed a recombinant lycopene pathway in E. coli by expressing the 
genes dxs (coding for 1-deoxy-D-xylulose 5-phosphate synthase) from E. coli, gps (coding 
for geranylgeranyl diphosphate (GGPP) synthase) from Archaeoglobus fulgidus, and 
crtBI (coding for phytoene synthase and desaturase, respectively) from Erwinia 
uredovora. These genes were inserted into pCL1920, a low-copy-number plasmid, to 
form pCW9, and simultaneously overexpressed. 

We used the glnApl promoter to control the expression of idi (isopentenyl 
diphosphate isomerase). Constructs containing the idi gene were derived from a 
promoterless vector, pJFl 1 8, The glnApl promoter was inserted to form p2IDI. As a 
control, the P tac promoter was inserted to form pTacIDI. These plasmids were separately 
introduced into a glnL strain (BW1 8302) containing pCW9. The p2IDI-containing strain 
(glnApl -idi) produced 100 mg L" 1 lycopene after 26 h in a defined medium containing 
glucose. The strain containing V tac -idi on the other hand, produced only a small amount 
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of lycopene, (< 5 mg L" 1 ) under identical conditions. Additionally, the p2IDI strain 
produced almost threefold less acetate than pTacIDI, which indicates that the carbon flux 
to acetate was being rechanneled to lycopene. 



Table 3. Carbon yield of lycopene formation in batch cultures of E. coli. 



Lycopene Carbon yield on glucose 
(mol C/mol C) 


Host only (BW18302) 


0.0000 


+ pTacIDI (Ptac-K/i) 


0.0003 


+ pTacIDI (Ptac-/</j) / pPS184 (Ptac-p/w) 


0.0012 


+ p2IDI (glnAp 2 -idi) 


0.014 


+ p2IDI (glnAp 2 -idi) 1 pPSGl 84 (glnAp 2 -pps) 


0.022 



Use of pps to Enhance Lycopene Yield 

pps was overexpressed from glnAp2 from another compatible plasmid, pPSG18 while the 
remainder of the lycopene pathway (dxs, gps, crtBI) was expressed using pCL1920. 
Coexpression of pps and idi with the lycopene pathway increased the final titer of 
lycopene by 50% and caused the productivity to increase by threefold, from 0.05 mg 
mL" 1 h" 1 to 0.16 mg ml/ 1 h" 1 (Table 3) This is in contrast to the companion strain 
containing both pTacIDI and pPS184 (P lac -idi + P, ac -pps), where no significant 
improvement in yield was observed and substantial growth inhibition occurred. 

Additional Host Cells for Lycopene Production 

The pykF::cat and pykAr.kan alleles were introduced into a wild-type strain, in order to 
generate two single mutants (JCL1610 (pykF) and JCL1612 (pykA)) and one double 
mutant strain (JCL1613 (pykFpykA)) (Ponce etal. (1995) J Bactehol\ll:51\9-S122). 
The double mutant strain was able to achieve a final lycopene titer of about 14 mg 
lycopene/g dried cells, while the single mutant strains each obtained lycopene titers of 
about 2.5 mg lycopene/g dried cells. The single pyk mutants produced lycopene at a level 
similar to the wild type strain,- 4 mg lycopene/g dried cells. Further, overexpression of 
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Pck, phosphoenolpyruvate carboxykinase , increased the final lycopene titer by about 3- 
fold. Overexpression of Ppc, phosphoenolpyruvate carboxylase, reduced lycopene 
production by about 30%. 



A number of embodiments of the invention have been described. Nevertheless, it 
will be understood that various modifications may be made without departing from the 
spirit and the scope of the present invention. Accordingly, other embodiments are within 
the scope of the following claims. For example, all homologs of the mentioned 
polypeptides and genes are within the scope of this invention. 



Other Embodiments 
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